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Abstract — We propose a new method to embed data in binary 
images, including scanned text, figures, and signatures. Our 
method rehes on the concept of wet paper codes. The shuffling 
before embedding is used in order to equalize irregular embed- 
ding capacity from diverse areas in the image. The hidden data 
can be extracted without the original binary image. We illustrate 
some examples of watermarked binary images after wet paper 
coding. 

I. Introduction 

A great majority of works on image watermarking (WM) 
has been devoted to color or gray scale images where pixel 
luminances may take values on a wide range. Then slight 
changings of even most of pixel luminance values may not 
be noticeable under normal viewing conditions. 

Data hiding on digital binary images (BI) has been inves- 
tigated in |1|. The main characteristic of this approach is the 
notion of flippable pixels which are precisely those that can 
be inverted (flipped) without noticeable corruption of BI. In 
our scheme, a bit string is embedded in each block based on 
the flippability score of the pixel within the block. This score 
is measured by the changes in smoothness and connectivity. 
A shuffling table or a key is required for correct extraction 
of the bit string watermark. In ||2|, Wu et al. have presented 
a flippability score computation method by considering the 
smoothness and connectivity of the cover image. In their 
work, the score is determined by considering the changes in 
smoothness and connectivity and it is reached by the proposed 
rules. However, the involved analysis is quite extensive for 
neighborhoods larger than 3x3 pixels. Another flippability 
score computation method was proposed in |3|. That method 
is based on the reciprocal distance calculation, which matches 
well within subjective evaluation, and it is easy to extend 
to block neighborhoods larger than 3x3 pixels. In [4|, the 
authors use a weight matrix to calculate the new transitions 
created by the flipping. That approach calculates the distortion 
by combining the changes in smoothness and in connectivity. 
Maxemchuk et al. ||5| changed line spacing and character 
spacing in order to embed information in textual images for 
bulk electronic publications. This method cannot be easily 
extended to other binary images and the size of hidden data 
is very small. 

In all those approaches the embedding capacity is not large. 
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Fig. 1. 3x3 patterns with flippable center pixels. 

In order to deal with this limitation, an approach has been 
proposed, called writing on wet paper |6|. 

In our proposed procedure, we exploit the notion of flippable 
pixels, as presented in [1|, and we combine it with the wet 
paper codes (WPC) given in |i6J. In section 2 we show a 
method for finding flippable pixels. Section 3 contains the 
description of the WPC technique. Section 4 contains some 
experimental results. Section 5 contains some conclusions and 
poses resulting open problems. 

II. WM Embedding Method for BI 

It was mentioned already that it is rather difficult to embed a 
WM into a BI without noticeable distortion. In f\\ an approach 
has been developed to look through each of the adjacent 
N X N areas of pixels and to embed a "0" in an area by either 
changing or not one of its pixels, so that the total number 
of black pixels in the area is an even number; similarly, to 
embed a "1", by either changing or not one pixel, so that the 
number of black pixels is an odd number If it is necessary to 
change one pixel in an area it has to be chosen as a flippable 
pixel. Obviously the area should be non-empty. We will specify 
flippable pixels in greater detail. 

Let us consider initially 3x3 areas with the intention to 
change only the central pixel (whenever it was necessary). In 
Figure [T] we present the patterns in which we will consider 
the central pixel as flippable, of course the central pixel of 
any pattern obtained by a rotation around the central axis or 
an inversion of those patterns is considered also as flippable. 
We remark that our choice of central-flippable patterns (CFP) 
is of a heuristic kind because we are mainly concerned in 
maintaining the connectivity features after a flipping. We may 
try to consider a larger area size, > 3, but this will involve 
the same 3x3 patterns displayed at Figure [T] Namely, we will 
shift a 3 X 3 area, by one pixel in turn, in both horizontal and 



vertical directions, along the N x N area until all flippable 
pixels are found. 

III. Writing of Wet Paper for the Case of Binary 
Images 

This name derives from a real situation, when the paper has 
been exposed to rain. The sender is able to write the message 
only over dry places. During transmission the paper dries out 
and the recipient does not know, which places were dry when 
encoding. However he is able to read the message easily. In 
information theory this technique is known as "writing in 
memory with defective cells". 

A. Encoder 

Let us assume that the sender has a binary column vector 
b = received from a binary image of n pixels (a black 

pixel gives value 1, and a white pixel gives 0), and an index 
set C C {1, . . . , n}, |C| = k, corresponding to those bits that 
are flippable and subject to be modified in order to embed 
a message. The sender wants to communicate a sequence 
of q bits, m = (mi,...,mg). First, let us assume that the 
recipient knows q and let us postpone later the case when the 
recipient does not know q. The sender and recipient use a 
shared stegokey to generate a pseudo-random binary {q x n)- 
matrix D. The sender shall modify some flippable bits bi, 
i £ C, m such a way that the modified binary column vector 
b' — {h[)^^-^ satisfies Dh' = m. Let v = b' — b, hence the 
support of vector v is a subset of C, then 

i:>v = m - L>b (1) 

Given m and b, the system of linear equations ([T]i should be 
solved for k unknowns Vi, i Q C (clearly, Vi = whenever 
i ^ C). By deleting the rows and columns with indexes in the 
complement of C, the system ([T]) can be rewritten as 

HV = m - Db (2) 

with H of order {q x k) and v' = (I'Oigc ^ fc-dimensional 
vector The equations system (j2]l is posed in the Galois field 
GF(2) and it has an unique solution whenever rank(i/) — q. 
Thus, in order to decide which flippable pixels should actually 
be flipped, the encoder should solve the equations system 

B. Decoder 

The modified binary image b' is sent to the recipient. The 
decoding is very simple because the recipient may recover the 
message m — Dh' using the shared matrix D. Note that the 
recipient is not required to know which pixels are flippable. 

We now explain how to relax the assumption that the 
recipient knows q. The sender and the recipient can agree 
matrix D in a row-by-row manner rather than as a whole 
planar {q x n)-array of bits. The sender may reserve the first 
[log2 n] bits of the message m as a header to inform the 
recipient of the number of rows in D (the symbol \x~\ denotes 
the smallest integer greater or equal than x). The recipient 
takes the first [logj n] rows of D, and applies the resulting 
( [log2 n] x n)-submatrix to the received vector b', in order to 
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TABLE 1 

The numbers of image areas Na, flippable pixels in the image 

Npp, AND EMBEDDED BITS INTO AN IMAGE Np- 



recover the header, i.e. the message length q. Then, he uses 
the rest of D to recover the message m = Dh' . 

C. Practical implementation 

Due to the great computational complexity of solving a 
system (j2| for a big value of q, we partition our image into 
areas of N pixels, and for each we solve a corresponding 
system Q of linear equations. Besides, since there is no a 
regular distribution of flippable pixels from area to area, we 
use shuffling of the image pixels as it was suggested in Q. 

If the length of the embedded message is greater than the 
number of flippable pixels in the area, we try to embed as 
many bits as possible. The remaining bits would be embedded 
into following areas. As remarked before, whenever condi- 
tion rank(i?) = g is fulfilled for an area, we may embed 
k — [log2 n] bits in that area. Otherwise, we may decrease q 
until condition rank(_ff) ~ q\s satisfied. 

IV. Experimental results 

We took n — 4096 pixels as the area size. In Table |l] we 
present the numbers Na, Npp, Np corresponding to image 
areas, flippable pixels in an image and embedded bits for three 
typical images. 

The three processed images, with actual size of about 
300 X 225 pixels, are presented in their original forms (before 
watermarking) and after watermarking in Figure |2] 

V. Conclusions 

It seems to be difficult the embedding into a BI of an 
additional information without remarkable distortions of the 
original image. But as it can be appreciated in Table |l] and 
Figure |2] it is indeed possible to embed a significant number 
of information bits without any visual difference among the 
original images and their watermarked copies. 

It is an interesting open problem to find an efficient sta- 
tistical method in order to distinguish the original BI from 
the watermarked BI. Solutions to this steganography problem 
are well known fS), f9l for gray-scale (or color) images but 
they have a poor adequacy for BI. Another interesting open 
problem is to develop a technique for selective authentication 
of BI when there is a group of legitimate and illegitimate 
distortions of the originals. If it occurs that distortions belong 
to the first group, then authentication is confirmed, while if 
it occurs at the second group, then authentication fails. The 
selective authentication is especially important for BI because 
the legitimate distortion can be resulted from channel noises 
or corruption after scanning of paper documents or images 
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Fig. 2. Images 1, 2, 3 in (a) original and (b) after watermarking with WPC. 



received by fax. We are going to present our contribution in 
this direction in the near future. 
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